Explanation vs Performance in Data Mining: A Case Study with Predicting Runaway Projects

نویسندگان

  • Tim Menzies
  • Osamu Mizuno
  • Yasunari Takagi
  • Tohru Kikuno
چکیده

Often, the explanatory power of a learned model must be traded off against model performance. In the case of predicting runaway software projects, we show that the twin goals of high performance and good explanatory power are achievable after applying a variety of data mining techniques (discrimination, feature subset selection, rule covering algorithms). This result is a new high water mark in predicting runaway projects. Measured in terms of precision, this new model is as good as can be expected for our data. Other methods might out-perform our result (e.g. by generating a smaller, more explainable model) but no other method could out-perform the precision of our learned model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Three Decision-Making Models in Differentiating Five Types of Heart Disease: A Case Study in Ghaem Sub-Specialty Hospital

Introduction: cardiovascular diseases are becoming the main cause of mortality and morbidity in most countries. This research goal was to predict the types of heart diseases for more accurate diagnosis by data mining and neural network technics. Method: This research was an applied-survey study and after data preprocessing, three approaches of neural network, decision making tree and Bayes simp...

متن کامل

Comparison of Three Decision-Making Models in Differentiating Five Types of Heart Disease: A Case Study in Ghaem Sub-Specialty Hospital

Introduction: cardiovascular diseases are becoming the main cause of mortality and morbidity in most countries. This research goal was to predict the types of heart diseases for more accurate diagnosis by data mining and neural network technics. Method: This research was an applied-survey study and after data preprocessing, three approaches of neural network, decision making tree and Bayes simp...

متن کامل

Improving Performance of Mining Equipment Through Enhancement of Speed Factor: A Case Study (Research Note)

Loading and hauling machineries are highly capital intensive equipment to procure, operate and maintain in surface mining operation. It must be borne in mind that with this huge and capital-intensive equipment, every second of its life time is absolutely important from the production and productivity point of view. As such, it is imperative to optimize the overall cycle time and speed factor of...

متن کامل

Prioritizing Organizational Projects with Fuzzy Data Envelopment Analysis Approach (Case Study of Educational Research Plans)

The right choice of research projects is one of the most important effective ways to increase the productivity of research projects and allocate the resources appropriately. Training and education is one of the most important institutions for educating the students and prosperous people of the country. Due to the limited resources of this organization and, on the other hand, the expansion of sc...

متن کامل

An Empirical Evaluation of Predicting Runaway Software Projects Using Bayesian Classification

Since software development projects often fall into runaway situations, detecting signs of runaway status in early stage of development has become important. In this paper, we propose a new scheme for the prediction of runaway projects based on an empirical questionnaire. We first design a questionnaire from five viewpoints within the projects: requirements, estimations, planning, team organiza...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JSEA

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2009